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Abstract —The rapid increase in data traffic demand has 
overloaded existing cellular networks. Planned upgrades in the 
communication architecture (e.g. LTE), while helpful, are not 
expected to suffice to keep up with demand. As a result, extensive 
densification through small cells, caching content closer to or even 
at the device, and device-to-device (D2D) communications are 
seen as necessary components for future heterogeneous cellular 
networks to withstand the data crunch. Nevertheless, these 
options imply new CAPEX and OPEX costs, extensive backhaul 
support, contract plan incentives for D2D, and a number of 
interesting tradeoffs arise for the operator. In this paper, we 
propose an analytical model to explore how much local storage 
and communication through “edge” nodes could help offload 
traffic in various heterogeneous network (HetNet) setups and 
levels of user tolerance to delays. We then use this model to 
optimize the storage allocation and access mode of different 
contents as a tradeoff between user satisfaction and cost to 
the operator. Finally, we validate our findings through realistic 
simulations and show that considerable amounts of traffic can 
be offloaded even under moderate densification levels. 

I. Introduction 

The growth in the number of “smart” mobile devices and 
connection speeds has led to a high volume of mobile data 
traffic. Cellular networks are currently overloaded and, despite 
a lot of planned improvements on the physical layer technolo¬ 
gies, they are not expected to be able to keep up with the 
rapidly increasing user data demand JT|. Radically reducing 
the communication distance by deploying, and offloading 
traffic to, many “small cells” (e.g. femto, pico, or even WiFi) 
is seen as the only viable solution O— O. Nevertheless, this 
requires a large investment in upgrading the backhaul network, 
increasingly based on wireless links, which will often be the 
new performance bottleneck 0. Caching popular content at 
the “edge”, i.e. on storage devices installed at small cell base 
stations could alleviate backhaul congestion 0, 0, and is 
supported by a number of real data studies suggesting a high 
amount of demand overlap between user requests 0-0. 

Reducing the communication distance is taken yet a step 
further with the newly proposed paradigm of device-to-device 
(D2D) communication Co), CD. A device can store a (popu¬ 
lar) content after consuming it, and give it directly to other 
neighboring devices also interested in it, offloading these 
requests from the main network. The connection between 
the two devices could be in-band (cellular frequencies) or 
out of band (e.g. Bluetooth, WiFi Direct). While D2D-based 


offloading normally assumes a content request will either be 
served immediately from a device currently in range or the 
cellular network, some recent works have suggested the use 
of opportunistic offloading through D2D: a device requesting 
some content might wait for some amount of time until it 
encounters another device sharing the content m-m, and 
go back to the main network if not found before some set 
deadline. 

Hence, more data could be offloaded from the main network 
through such D2D communication, perhaps at the expense of 
increased delay for some requests. Such increased delays could 
sometimes be acceptable (e.g. asynchronous requests, longer 
start-up or buffering delays easily amortized when considering 
large content). Yet, in many cases, the operator will need to 
provide appropriate incentives to these users, either in the form 
of instantaneous price reductions fl5ll or low(er) priced plans. 
What is more, operators will probably need to also provide 
incentives to the devices storing the content and acting as 
local relays on their behalf, as this raises important battery 
consumption, storage, as well as privacy and security issues. 

The provision of these incentives constitutes another im¬ 
portant form of cost for the operator, together with the costs 
of directly serving the content from the main (mostly macro¬ 
cell based) network, and that of installing, maintaining, and 
supporting with ample backhaul capacity, new small cells with 
large enough caches. It thus becomes increasingly important 
for an operator of such a future Heterogeneous Network 
(HetNet) with caching and D2D capabilities to be able to 
answer questions like: ’’How much content can be offloaded 
by a given setup as a function of content demand patterns?”, 
”Is it worth investing in additional cell densification, or 
would it be more cost-efficient to provide incentives for D2D 
opportunistic offloading?". 

To this end, in this paper we propose an analytical model 
that can be used to study the problem of ’’offloading on the 
edge” in a HetNet. Although capturing all the fine details of 
possible setups and technologies would be a rather daunting 
task, we assume two main mechanisms being employed in 
the considered network, namely (i) caching on small cells 
and mobile devices, collectively referred to as ’’edge nodes”, 
and (ii) offloading requests through local, short range com¬ 
munications (e.g. D2D or low power communication to local 
femto or pico base stations). We describe the ’’offloading on 


the edge” mechanism and propose a generic model that allows 
us to analytically study it (SectionUTJ). We proceed by deriving 
useful results for the performance of content delivery through 
this mechanism and the incurred costs, as a function of key 
system parameters (Section ED Then, we study the total 
offloading cost and provide insights for content placement and 
dissemination strategies that minimize this cost (Section HVI) . 
Finally, we validate our results through realistic simulations 
(Section [V]) and discuss related work (Section [Vllb . 

Summarizing, the main contributions of our work are: 

« To our best knowledge, this is the first work jointly 
and analytically studying offloading through small cells, 
opportunistic D2D, and caching at both. 

« We provide closed-form analytical approximations appli¬ 
cable to a number of performance metrics and network 
setups. 

« We provide initial insights into the various design trade¬ 
offs involved, as well as the efficient allocation of storage 
space among different contents. 

II. Offloading on the Edge 

A. Network Setup 

We consider a Heterogeneous Cellular Network (Het- 
Net) 0, composed of 3 sets of nodes: 

Macro-cell Base Stations (BS ): They provide full coverage 
to subscribed mobile nodes (MNs), but we assume their radio 
resources are congested. 

Small Cells (SC): These are shorter range, low power base 
stations (e.g. femto and pico-cells, or even WiFi access points) 
dispersed in the area of coverage. They provide ample capacity 
to the few MNs within range, and their communication cost 
to/from a MN is smaller El. Hence, they can be used 
to offload some traffic from BSs. However, the backhaul 
connection for these cells will often be wireless (either to a 
BS or to an aggregation point) and underprovisioned 0. This 
makes a backhaul transmission to a small cell costly. To this 
end, each small cell is equipped with some storage capacity, as 
in 0, 0, where (popular) content could be cached to avoid 
duplicate backhaul accesses. 

Mobile Nodes (M.N): These include smartphones, tablets, 
netbooks, etc. MNs can communicate with BSs, SCs (if in 
range), and even other MNs directly, if D2D communication 
is allowed. D2D communication potentially offers higher rates 
at lower interference levels GDI. Yet, appropriate incentives 
from the operator might be needed. Without loss of generality, 
we assume out-of-band communication (e.g. WiFi Direct or 
Bluetooth) for D2D. We also assume that each MN also has 
some storage capacity (normally less than that of a small cell) 
for caching (popular) content. 

The number of nodes in each set is 

N bs = \BS\ , Nsc — |<5C| , Nm N = \MN\ 
where | • | denotes the cardinality of a set. 


B. Offloading Mechanism 

Content Requests. We assume that each MN is interested in 
different contents over time (e.g. videos, web pages, software 
updates, etc.), and that the same content may be of interest 
to multiple MNs. This interest overlap is supported by recent 
studies (e.g. 0-0, to name a few), where the popularity 
distribution of contents is shown to be highly skewed. In the 
remainder, we will be assuming that the number of nodes 
interested in a content, the content popularity, is known in 
advance or can be estimated. For a number of applications, 
like push services ED, this information can be known in 
advance by the cellular network. Users are subscribed to a 
push service they are interested in (e.g. news, series episodes, 
trending videos, etc.), and when a content (of this service) is 
created or published, the content provider starts distributing 
(pushing ) it to thcrrQ. Similarly, users might subscribe to 
certain categories of contents, such as personalized Internet 
radio stations like Pandora and Jang<H The content of these 
pseudo-random streams of songs can be decided in advance, 
and thus the popularity of songs belonging to different streams 
can be estimated. 

Content Delivery. An operator can deliver a content to 
an interested MNs in one of the following ways: (i) Direct 
transmission from a BS; (ii) Offloading through SCs and/or 
MNs. where the operator transmits the content to some SCs 
over the backhaul and stores it there, or instructs some MNs 
to store a content for some time (e.g. keeping in their cache 
a content they consumed). Then, the operator can ask an 
interested MN within range of a SC or MN caching that 
content to retrieve it directly. 

Moreover, an operator can ask an MN interested in a content 
6, but not currently within range of an SC or MN with content 
9 in its cache, to wait for an amount of time, let TTL, until 
it moves within range of such an SC or MN. If this time 
expires, then the operator is obliged to deliver the content 
directly through the closest macro BS. While this delay- 
tolerant approach is in contrast to the usual ones considered for 
small cell and D2D based offloading 0, 0, ifTTI . it is likely 
that the small cell and (D2D enabled) mobile node density 
will not always be enough to offload enough traffic. Hence, it 
is a valuable (and complementary) alternative, with potential 
benefits (increased offloading) and costs (reduced QoE and 
potential monetary incentivesjj. 

C. Cost Model 

The goal of an offloading mechanism is to minimize the cost 
of delivering a set of contents over time to different nodes. 
Hence, we need first to define a model for the costs involved 

1 We assume that the content provider may be the cellular network operator 
itself or in cooperation with it (like the Akamai and Swisscom example H3). 

i www.pandora.com , www.jango.com 

3 Clearly, such delays might not be acceptable for all applications. However, 
many applications are inherently delay-tolerant, e.g. software updates, file 
downloads, one way streaming (e.g. YouTube or Netflix). Moreover, users 
might be willing to accept small or larger delays, if appropriate incentives 
are provided, and delayed content delivery has already been considered in a 
number of contexts, e.g tm, odd 


in each phase of the ’’offloading on the edge” mechanism. 

— Initial Placement Costs: Cbh, Cbs- 

The content provider, at time to = 0, places the content to 
some edge nodes (SCs and/or MNs). A content is placed to 
a SC through a backfiaul (wired or wireless) transmission, 
and we denote this per placement cost as Cbh- A (possible) 
content placement to some MNs takes place through a macro¬ 
cell BS transmission. We denote this transmission cost, which 
mainly depends on the load/congestion of the BSs, as Cbs- 

— Opportunistic Offloading Costs: Csc> Cd 2 D- 

During time t £ (0, TTL], the holders (which are either SCs 
or MNs) deliver the content to any requester they meet. We 
consider different costs for a SC-MN and a MN-MN (or D2D) 
transmission: Csc and CD 2 D- The former cost depends on the 
operating cost (transmission, energy consumption) of an SC, 
whereas the latter might exist if a compensation (or reward) 
is given by operator to MNs for each content they offload. 

Delayed Delivery Cost: C^g L \ 

At time TTL, the cellular network sends through macro¬ 
cell BSs the content to every non-served requester. This cost 
relates both to the load of BS (as Cbs ) and to a (possible) 
compensation to the MNs for a delayed delivery. We denote 
this (per transmission) cost as C^g L \ 

D. Content Dissemination Model and Assumptions 

Let us assume a content item (e.g. a popular video file) 
and a set of MNs interested in it. The content provider, at 
time to = 0, places the content to the caches of some SCs 
and/or MNs. If by an expiry time TTL (if any), some of the 
interested MNs have not met any edge node (SC or MN) with 
the content, they are served by a macro-cell BS0. 

For the ease of reference, we define the following sets of 
’’edge nodes” that are involved in the offloading process: 

Definition 1. A requester of a content is a mobile node (MN) 
that (a) is interested in the content and (b) has not received 
it yet. We denote the set of requesters at time t as 7 Z(t). 

Definition 2. A holder of a content is an edge node (SC 
or MN) that stores the content and will forward it to its 
requesters. We denote the set of holders at time t as Tl(t). 

We further denote the number of requesters and holders as: 

R{t) = \K{t)\ and H(t) = \H(t)\ 

where Tl(t) = Hscif) CT-Lmn^) and H(t) = Hscif) + 

To model the level of participation of MNs in the offloading 
mechanism, we make the following assumption. 

Assumption 1 (Cooperation). A requester acts as a holder 
for a content it has received with probability p c £ [0,1]. The 
probability p c is equal among all nodes and contents. 

4 In the mechanism we consider, the content is cached only at the initial 
time, to = 0, and macro-cell BSs deliver it only at its expiry time , t = TTL. 
Although one could place contents during time t £ (0, TTL) as well, it has 
been shown (for similar settings) that placing contents at times t £ (0, TTL) 
leads to a sub-optimal performance [13, (H). 


The probability p c captures either the chance a node to 
forward the content (e.g. it has enough resources at the time) 
or the percentage of nodes who are ’’contracted” to helpd 

Finally, since edge nodes can exchange data only when 
they come within transmission range, the offloading is heavily 
affected by these meeting events between nodes. We assume 
the following class of node mobility. 

Assumption 2 (Mobility). 

— The meeting events between two nodes {/, j}, i £ MM and 
j £ MM U SC, are given by a Poisson process with rate A ij. 

— The meeting rates A ij are drawn from an (arbitrary) 
probability distribution f\ (A) with mean value p\. 

— Meeting duration is negligible compared to the time inter¬ 
vals between nodes, but long enough for a content exchange. 

Assumption [2] is a tradeoff between realism (heterogeneous 
A ij) and tractability (Poisson process). Heterogeneous meeting 
rates are motivated by analysis of real mobility traces ED, 
ll22l . where not all people meet each other with the same 
frequency, and by the different communication ranges (SC- 
MN and MN-MN). Similar assumptions are common in related 
works Ea-iH, is, eu and have been shown to not be far 
from real mobility OTl . 11221 . Yet, in Section [V] we test our 
results against realistic scenarios where node mobility departs 
from our assumptions and involves much more complexity. 

III. Analysis 

An operator, in order to optimize the offloading performance 
and cost, has to weigh its options and take decisions about: 
how to deliver each content (directly or through offloading), 
how many copies of a content should be placed to different 
edge nodes, which contents to store in the SC and/or MN 
caches when their capacity is limited, etc. To this end, in this 
section, we provide the analytical results that are needed when 
trying to answer these questions. Specifically, we provide 
simple, closed form expressions for the performance of the 
’’offloading on the edge” mechanism (Section IIII-Ab . and the 
costs it incurs (Section IHI-Bb . 

A. Content Dissemination 

The performance of the “offloading on the edge” mechanism 
depends on how much traffic it can offload and/or how fast are 
contents delivered. To answer these questions, we calculate the 
two main (and most common) performance metrics, namely 
the content delivery probability, and content delivery delay. 

First, we state the following Lemma, in which we use a 
mean field approximation and a resulting system of ODEs to 
approximate the number of holders and requesters over time. 

5 Here, we need to stress that the above assumption implies that MNs will 
never become holders of a content they are not interested in. Although there 
exist studies that assume that even not interested MNs might be willing to 
act as holders G3, m, OH, Hi, we believe that incentive mechanisms for 
these cases are difficult to implement (e.g. a user easier accepts to forward 
a content it already has stored, than to retrieve, cache and forward a content 
it will never use). Nevertheless, our framework could be easily extended also 
for such cases. 







Fig. 1: Content dissemination modeled by a Markov Chain. 


Lemma 1. The fluid-limit deterministic approximation for the 
expected number of holders (H(t)) and requesters (R(t)) at 
time t, is 

(p c -R 0 + H 0 )-e^<Pc R o+ H o)t 
U °' p c - R 0 + H 0 ■ e^<Po R o+H 0 )-t 
D _ Pc' Rp+ Hq _ 

[ 1 0 ' p c ■ Ro + Ho ■ e^<Pc- R o+Ho)-t 

where Ho = H{ 0 + ) and Ro = R( 0 + ). 

Proof Having assumed Poisson meeting processes, we can 
model the dissemination of a content with a continuous 
Markov Chain, whose states correspond to the different sets 
of holders and requesters {77,72}. Fig. |T] shows a segment of 
this Markov Chain; we present the different states with equal 
number of holders (|77|) and requesters (|72|) under the same 
group, which can be described by the tuples {|77|, |72|}. To 
transition between states a content delivery, which takes place 
when a holder i GH and a requester j £ 72 meet, is needed: 

(i) Content delivery to cooperative node. The next state is 
{|77| = m + 1, |72.| = n — 1} and the transition rate 

•\m,,ra)->-(m+l,ra-l) = Pc ' YieW. YjQTl (1) 

(ii) Content delivery to non-cooperative node. The next state 
is {|77| = to, 172.| =n—l} and the transition rate 


1) — (1 Pc) ' Yieti YjeK Yj 


( 2 ) 


Statistics for the content dissemination process over time 
(e.g. distribution of |77(7)| or |72.(i)|), can be computed using 
the transition matrix of the Markov Chain of Fig. [I] However, 
this would render the problem analytically (and numerically, 
for large networks) intractable. To this end, we approach the 
problem with a mean field approximation of stochastic reaction 
models lf24l . 

We first form the drift equation li24l Theorem 1.4.1] for the 
expected number of holders, E [\H (i)|] = E [77(7)], as: 


dE [77(7)] 
dt 


R [^(m,n)—1)] — Pc ' R 


i&'Hjen 


The expectation in the right side of the drift equation is 
difficult to compute, as it requires the computation of the 


probabilities over the whole state space {H,1Z}. To this end, 
one can approximate E[H(t)\ with its deterministic equivalent 
7(7). This approximation comes after neglecting the variability 
of H (7) around its mean value and becomes more accurate for 
larger systems ll24l Section 1.5]. 

Based on the deterministic approximation and since (a) 
the rates Ay are drawn independently from a distribution 
fx( A) with mean value p,\ (77[Ay] = p\), and (b) the sum 
Yien YjeK Yj consists of |77| • |72| terms, we can write 


E 


Yi^n Yjen Yj 


h(t ) • r(t) ■ p\ 


(3) 


The higher the number of terms in the above sum, and the 
less the heterogeneity of the meeting rates (i.e. the variance 
of /a (A)), the more accurate the approximation in Eq. (Q} is. 

Substituting Eq. (0 in the drift equation (where // (t) —> 
h(t)), gives the ordinary differential equation (ODE) for h(t f^ 

- = Pc ■ h(t ) • r(t) ■ p,\ (4) 

dt 

Proceeding similarly, the ODE for the deterministic approx¬ 
imation of the number of requesters (R(t) —> r(f)), is 

= ~h(t) ■ r(t) ■ p\ (5) 

Finally, solving the system of the ODEs of Eq. (|4]i and 
Eq. 0, gives the expressions of Lemma [j] □ 


Based on Lemma Q] we, now, proceed to the calculation 
of the performance metrics. Let us consider a requester i £ 
72(0 + ), and denote as T, the time it receives the content. The 
probability this (random) requester to receive the content by a 
time t, i.e. P{X) < t), is equal to the percentage of offloaded 
contents by time t. Hence, we can write 


P{Ti <t} = 


R-o — R(t) 

Ro 


= 1 - 


m 

Ro 


( 6 ) 


Substituting the expression of Lemma [T] in Eq. 0, gives the 
following Result for the content delivery probability 


Result 1 (Delivery Probability). The probability a content to 
be delivered to a requester by time t is given by 


P{T d <t} = 1 


Pc ■ Ro + Ho 

Pc- R 0 + H 0 - e^-(pcflo+ffo)t 


where Hq = H(0 + ) and Ro = R( 0 + ). 


With respect to the average delay a requester experiences 
till it receives the content, we state the following Result (the 
proof is technical and can be found in Appendix©. We derive 
expressions for two cases: (a) the content does not expire 
(i.e. TTL —> oo), and (b) a macro-cell BS serves undelivered 
contents at time t = TTL. . 


6 Note the differences between H(t) and hit): (a) H(t) is integer, whereas 
h(f) is a real number; (b) the drift equation for H(t) contains expectations, 
while the respective ODE for h(t) does not. 
















Result 2 (Delivery Delay). The expected content delivery 
delay, under an expiry time TTL £ [0, oo), is given by 
- for p c > 0: 


E[T d \TTL\ 


In 


/ p c -R 0 - e -Ux-(Pc-Ro + H 0 )-TTL \ 

\ + H 0 +p c ■ R 0 ■ e -U\-(Pc R 0 +H 0 )-TTL J 

px-p c - Ro 


- for p c = 0: 


E[T d \TTL\ 


\ _ g-UX'Ho-TTL 

Px - Ho 


where Hq = H( 0 + ) and Rq = f?(0 + ). 


B. Content Delivery Cost 

Incorporating the offloading costs (Section 111-Ct in our 
content dissemination model, and using the analytical results 
of Section IIII-AI we calculate the cost of a single content 
delivery in Result [3] The expression we derive, gives the cost 
as a (simple) function of the system parameters (e.g. Rq, px) 
and the operator selected parameters (e.g. Hsc(0), -Hmjv(O)), 
providing, thus, the necessary information for the evaluation 
and tuning of the “offloading on the edge” mechanism. 

Result 3. The cost of “offloading on the edge” a content is 
given by 


C =C B h ■ H S c{ 0) + C B s ' H mn ( 0) 

+ (Csc ■ q + C D2D ■ (1 - q)) ■ R 0 ■ P{T d < TTL} 

+ C ( bs L) 'Ho-(l- P{T d < TTL}) 

H (o') 1 ( H( ~ TTL '> ^ 

where q = Z { Ro-R(tt° L)) • ™d P{T d < TTL}, H(TTL) 
and R{TTL) are given in Lemma\T}and Result}]} 

Proof. 

— Initial Placement. The first two terms correspond to the 
initial placement phase: The cellular network operator, at 
time t = 0, places the content to Hsc{ 0) SCs and Hmn(0) 
MNs; in total (Hq = Hsc(0) + Hmn{ 0)) holders. The costs 
per content placement are Cbh and Cbs, respectively. 

— Opportunistic Offloading. During the opportunistic offload¬ 
ing phase, i.e. t £ (0, TTL), the average number of requesters 
that receive the content by an edge node is Ra-P{T d < TTL}. 
If we denote with q the percentage of requesters that receive 
the content by a SC, it is easy to see that the costs due to 
SC-MN and MN-MN content deliveries are 


Csc-q-Ro- P{T d < TTL} (7) 

C D2 d ■ (1 - q) ■ Ro ■ P{T d < TTL} (8) 


respectively. 

To calculate the percentage q we proceed as following: 

At first, the total number of requesters that receive the 
content by time TTL is 


(9) 


Second, the total number of requesters that receive the 
content in the interval (t,t + dt\, t £ (0 ,TTL) is 

R(t) — R(t,t + dt) = —dR(t) (10) 


The probability that a content delivery that takes place in the 
interval in the interval (f, t + dt] is due to a SC is equal to 


H SC { 0) 
H(t) 


£[ 0 , 1 ] 


( 11 ) 


where ITsc(0) is the number of SC holders (which does not 
change over time), and H(t) the total number of holders at 
time t. 

Therefore, the number of requesters that receive the content 
by an SC holder in the interval (t,t + dt\ is given by — dR(t) ■ 
g g(£) , and the total number of requesters that receive the 
content by an SC holder by time TTL is 

*Rsc = J 0 TTL —dR(t) ■ Mfogl = f o TTL -«m. . Sfojp.. dt 

Eq =J *f™ L H (t)-R(t)-px-Z§ffi-dt 

= Px-H sc (0)fn TL R(t)-dt (12) 

Using the expression of Lemma |T] for R(t) to calculate the 
above integral, we get 


#Rsc 


Hsc( 0) 

Pc 


■ In 


Hsc( 0) ln 

Pc 


/ (p c ■ Rq + H 0 ) ■ e Ux-{Pc-R 0 +H 0 )-TTL\ 

\ Pc-Rq+Hq- e^'(Pc «o+ffo) TTL J 


/ H(TTL)\ 

l Ho J 


(13) 


where the last equality follows from the expression for H(t) 
given in Lemma |T] 

Now, q easily follows from Eq. @ and Eq. 


, / HITTL) \ 

#Rsc H sc ( 0) 

#Rtot Pc Ro - R(TTL) 


(14) 


— Delayed Delivery. Finally, the average number of requesters 
that do not receive the content before its expiry time, is given 
by Rq ■ (1 — P{T d < TTL}). Since, the cost of each content 
transmission at time t = TTL is C^g L \ the total cost of 
delayed delivery phase (last line of the expression in Lemma[3]> 
follows easily. □ 


IV. Applications: Cost Optimization 

In a real scenario, the network operator would have to 
offload simultaneusly many different contents. Using the re¬ 
sults of the previous section, the average performance or the 
total cost over all the contents can be calculated easily, by 
evaluating them for each content separately and then averaging 
or summing them, respectively. However, since some of the 
system parameters are controlled by the operator (e.g. Hq), 
they can be selected such that they lead to optimal perfor¬ 
mance. To this end, in this section, as an application of our 
analytical results, we study how offloading and caching can 
be designed in order to minimize the total cost. 


#Rtot = Ro - R(t) 

























A. Optimizing the Total Offloading Cost 

Let us assume that the content provider has to deliver M > 
1 contents to their requesters. We denote the set of the contents 
as Ai (M = \Ad\). Since in a real scenario, not all contents 
are expected to be equally popular Q-m, nor tolerate equal 
delays, we denote the popularity (i.e. the number of initial 
requesters) and the expiry time of each content 6 G Ai as R 8 
and TTL 8 , respectively. 

Under a given setting (i.e. with certain mobility, cooper¬ 
ation, traffic, etc., characteristics), what the cellular network 
can select, is the initial placement ( caching ) for each content 
9 G Ai', namely, the number of SC and MN initial hold¬ 
ers, H 8 C ( 0) and H 8 MN ( 0), respectively (note that H 8 (0) = 
H e sc (0) + H 8 mn (0)). 

Therefore, if we denote as C 8 is the delivery cost of a 
content 9 G Ai (which is given by Result [5]), we can express 
the total cost optimization problem as 

Problem 1. 

min ffsc, ~Hmn {LeeM^} 

s.t. V9 gA4: 0 < H 8 sc ( 0) < N sc 
0 < H 8 MN ( 0) < R e ( 0) 

and E H sc( o) < E Q® 

9GM i&SC 

where Hsc an d Hmn denote the vectors with components 
Hg C ( 0) an d H%n (0) (9 G Ai), respectively, and Q(i) is the 
caching capacity (in number of contents) of a SC *• 

The costs C 8 in the objective function of Problem [7] can 
be expressed as a function of the optimization variables 
(Result 0. As a result, well known numerical methods can 
be employed to solve Problem [7] In the following numerical 
example, we calculate the optimal content placement by solv¬ 
ing ProblemQ] apply the “offloading on the edge” mechanism, 
and demonstrate how it can reduce the total content delivery 
cost. 

Numerical Example: We assume a network with Nsc = 4 
SCs, each of them having a storage capacity of Q = 100 
contents. Edge nodes meet each other with an average meeting 
rate p\ = 3.3 ■ 10 -5 meetings/sec (equal to this of the 
real mobility trace li25l ). The cellular network has to deliver 
M = 100 contents of equal size and expiry time TTL. Content 
popularity is given by a bounded Pareto distribution in the 
interval Rq G [10,1000] with shape parameter a = 0.5 J7j or 
a = 1. The (normalized) costs arcfl 

Cbh = 0.8 C sc = 0.2 cfs L) = 2 

CbS = 1 Cd2D = 0.1 

7 Since MNs cache only contents in which they are interested in, we assume 
that then* storage capacity is enough for all the contents of interest. Hence, 
storage capacity constraints for MNs are not considered in Problem [I] 

8 In general, the offloading costs incurred in each phase, might differ 
between areas, time periods and operators. Their absolute values are not 
available and/or are difficult to estimate. To this end, in this example, as 
well as in other numerical results, we use relative values inferred by some 
average values proposed in ED. 



(a) p c = 0.1 (b) TTL = 5 min. 


Fig. 2: Relative Cost Decrease c ~ < f ,° ss , where C and C 0 ff 
denote the total cost of delivering the contents without and 
with offloading, respectively. 


Fig. m shows how the total content delivery cost decreases 
when the cellular network offloads contents on the edge nodes. 
Specifically, we present the Relative Cost Decrease, RCD, 
~ c ° sl , where C and C 0 // denote the total cost of delivering 
the contents without and with offloading, respectively. High 
values of the RCD correspond to cases where the cost of 
“offloading on the edge” C 0 f / is small (compared to the cost 
without offloading C ), whereas low values of the RCD indicate 
that the gain due to offloading is not significant. 

In Fig. |2(a)| we present how RCD changes for different 
values of delay tolerance, TTL, when the percentage of MNs 
contributing to offloading is 10% ( p c = 0.1). In Fig. |2(b)| 
we present the change of RCD for different MN cooperation 
levels ( p c ) and for a delay tolerance equal to TTL = 5mm. 
The two curves, in both figures, correspond to scenarios where 
content popularity is Pareto distributed with exponent a = 0.5 
and a = 1. Some important observations are the following: 
(i) Higher values p c or TTL lead to lower total offloading 
costs, i.e. the more willing an MN is to offload contents or to 
tolerate delays, the more effective the “offloading on the edge” 
becomes. In particular, for the presented scenarios, the content 
delivery cost can decrease up to 80% for large TTL values 
(Fig. |2(aj] i, while for large p c values the offloading cost can be 
40% less than the cost without offloading (Fig. |2(b)| i. (ii) The 
cost gains are larger for smaller Pareto exponents a. This is 
due to the fact that (a) for the more skewed distributions, i.e. 
a = 1, there exist more unpopular contents than in the case 
of a = 0.5, and (b) “offloading on the edge” (and, in general, 
offloading mechanisms) is more efficient when distributing 
popular contents. 


B. Case Studies: Optimal Content Placement 

Under certain scenarios, analytical solutions for Problem |T| 
can be found as well. In the remainder, we focus on two char¬ 
acteristic cases, which are analytically solvable, and provide 
useful insights for the system. 


Offloading through SCs 

We first consider the case where contents are offloaded only 
through SCs (i.e. when p c = 0 and H 8 MN (<S) = 0, or equiva¬ 
lently, H 8 = Hg C (0)). This is the most common and feasible 




























scenario considered in previous literature, since MNs are not 
required to share their contents, and thus incentive mechanisms 
are easier to implement. In this case and foi[] Csc < CZ TL) 
it can be proved that Problem Q] is convex and we compute 
the analytical solution in Result [4] For notation simplicity, we 
consider equal expiry times TTL 0 = TTL, \/6 £ M., and 
cache sizes Q(i) = Q, Vi £ SC. However, Result [4] can be 
easily modified for different TTL 0 and Q(i) values. 


Result 4. Under a base scenario (p c = 0, Hmn{ 0) = 0), the 
initial allocation Hgc that minimizes the total cost, is given 
by 


H 6 SC ( 0 ) 


N SC 

i-ln(i.f?») 

0 


,R e { 0) > U 
, L < R e { 0) < U 
,R 0 { 0) < L 


with 7 = p\ ■ TTL, L = ^ 


$ = 


MTTL ) n 
( B S _ CSC 

Cbh ’ 


( 1+ U = L ■ e J ' Nsc , 


and 


A 0 = inf A 0 > 0 : ^ H e sc ( 0) < £ Q(i) 

{ 8<ZM iesc 

Proof. Applying the method of Lagrange multipliers |26l 
to Problem [I] gives (for brevity we use the notation 
H 0 = H 0 C ( 0+) = H 0 C ( 0) and R 0 = R 0 { 0+) = R 0 ( 0)): 


v f E c*) = VA ° (E qw - E H o) 

\8eM ) \iesc 8eM J 

+ V Y, Ae-Ztf + V Y, Pe-( N sc-H 0 ) (15) 

9(zm eeM 

where Ao > 0 and Xg, pg > 0 , \/8 £ M. are the langrangian 
multipliers. 

Using the expression of ResultUfor the delivery probability, 
the offloading cost (Result [3J of a content 6, in a base scenario, 
can be written as 


C 0 = C bh -H 0 +Csc-KHC ( bI L) -Csc)-RU-^<- ttl 

(16) 

Substituting C e from Eq. (fl6l > to Eq. ( 1 1 51 . the differentiation 
over H 0 gives 


HS = ~ 


1 

7 


In ($ • 7 • R 0 ) - In ( 1 + 


Ao — Ag + fig 

Cbh 


(17) 


The conditions for the lagrangian multipliers, i.e. 


X e -H 0 = O, and p e ■ {N sc ~ K) = 0 £ M 


imply that H 0 either 

(a) is given by Eq. O and \q = fie = 0 , or 

(b) is equal to Nsc and \q = 0 , fie > 0 , or 


9 The “offloading on the edge” mechanism is meaningful if Csc < 
C^g L \ as in the opposite case, offloading would cost more than directly 
delivering from the macro-cell BSs. 

10 In particular, one has to substitute 7 with 7 & = fi\ ■ TTL e for 
each content. The expressions for Hg C ( 0) remain the same, and only the 
expressions of L and U need to be modified. 


(c) is equal to 0 and \q > 0 , fie = 0 
From condition (a), we calculate the limits of the interval 
within which the optimal Hq is given by Eq. dm . To find the 
lower limit, L, we set H 0 (Eq. (fl7l) with A e = ne = 0) equal 
to 0 and for the upper limit, U, equal to Nsc, which give 
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(18a) 


U = —• e' 1 ' Nsc ■ (1 + -E ) = L ■ e 7 ^ 30 (18b) 

7 ■ $ V Cbh ) 

Combining Eq. (HU and Eqs. we can express the 

optimal placement as 
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< L 


The only unknown parameter in Eq. ( fl9l > is Ao (since we 
expressed L and U as functions of Ao). Lemma [2] which we 
state and prove in Appendix [Bj suggests that the total cost, 
ffi/.-M C e , is monotonically increasing with Ao- Therefore, 
the optimal placement policy corresponds to the smaller non¬ 
negative value of Ao that satisfies the storage constraint, 
HZeeM H 6 o < J2iesc Q(*)> an£ l this proves the Result. □ 


In general, the value of the parameter Ao can be found 
(within some precision) with e.g. a binary search. Neverthe¬ 
less, for a large number of contents, and given their popularity 
distribution, its value can be directly calculated using the 
Corollary [T] which follows after substituting the expression 
of Result Q] and the popularity density function in the storage 
constraint H sc(° ) = E,; e sc <?(*)■ 

Corollary 1. Under a content popularity distribution p(x), 
the parameter Ao in Result @ is given by Ao = max|o, Ao|, 
where Ao is the (minimum) solution of 


j In (7 ■ $ • x) ■ p(x)dx — In ^1 + ^ ■ J p(x)dx 


+ 7 ' N sc ■ 


p{x)dx = 


7 • N S c ■ Q 
M 


Result [4] reveals how resources should be allocated: (i) The 
optimal allocation is logarithmic in popularity, with either 
large or small caches, (ii) When capacity is limited, an extra 
factor (Ao) is introduced, so that the relative allocation remains 
logarithmic, but the absolute allocation is reduced (normal¬ 
ized) as the number of contents increase, or total capacity 
decreases, (iii) Some low popularity contents might get no 
allocation, either because it does not help the offloading cost, 
or because there is not enough capacity for them. 

Practical Example: Assume an urban area covered by 
Nbs = 4 macro-cell BSs and Nsc = 100 SCs. On average, 
in this area reside Nmn = 10000 userf^ with an average 


u Vodafone Germany reported an average number of 1700 users per cell 
(http://mobilesociety.typepad.com/mobile_life/2009/06/base-station-numbers.html 1 . 
In an urban environment, users density is expected to be higher. 














meeting rate ji\ = 3.3 • 10 -5 meetings/sec |25| . The cellular 
network has to deliver M contents (e.g. YouTube video files 
of an average size 10 MB 0) with expiry time TTL « 5 min 
and popularity given by a bounded Pareto distribution in the 
interval Rq £ [10,1000] with shape parameter a = 0.5 f7|. 
The costs are = 10 -Cbh and Csc *C Cbh , C^g L \ 

Substituting the given values, and taking the expectation 
over the popularity distribution, it follows that the necessary 
buffer size of a SC, Q = -M-L, is approximately 1 MB 

per content. This means that, even under very high traffic 
demand, the caching capacity of the SCs would be adequate 
such that the last constraint of Problem Q] is not violated; e.g. 
for M = 100000 (i.e. each user requests 10 videos per 5 
minutes!), the needed capacity is Q = 100G73 (which is a 
feasible and relatively cheap investment). 


Offloading through MNs 

We now consider the case where offloading takes place only 
through MN-MN communication ( p c > 0) and without content 
storing on SCs (i.e. fTsc(0) = 0). A content is initially sent 
by the BSs to Hmn(0 ) (out of R( 0)) of its requesters, which 
start disseminating it to the other requesters. However, not all 
nodes might be willing to participate by acting as holders, 
which in our framework means that each node (including the 
initial nodes in which the content is placed) cooperates with 
probability p c . Therefore, we can write 

H 0 = H M n{ 0 + ) = Pc ■ H M n{ 0) 

Also, as defined in Lemma [Q 

R 0 = R(0 + ) = R(0) - H mn (0) 


As in the previous case, we assume equal expiry times 
TTL 6 = TTL, M8 £ M. 


Result 5. Under an opportunistic MN-MN scenario (p c > 0, 
Hmn{ 0) — 0 ), the initial allocation Hmn that minimizes the 
total cost, is given by 
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where OPT 6 = --- HoTTZ. -—, and 
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* = H,-cZ °" d 1 = ^-TTL- 


Proof. The cost for offloading a content 9 under an oppor¬ 
tunistic MN-MN scenario, where H 6 = p c ■ H 6 MN (ff) and 
R 6 = R(0) s - H 6 mn { 0), is (see Result© 


C 6 = C BS ■ H 6 mn { 0) 

+ ( C D2D - cg TL) ) ■ (R e ( 0 ) - H 6 mn { 0 )) • P{T d < TTL } 
+ C^ L) -{R 6 (f))-H 6 MN m (20) 


Similarly, for H 6 = p c -H e MN { 0) and R 6 = R 6 (0)- H e MN (0), 
the delivery probability P{T d < TTL} can be written as 


P{T d < TTL} = 1 


it s ( 0 ) ('91t 


where 7 = ■ TTL. 

Substituting Eq. (EH in Eq. ( EOl l. and taking the derivative 
over the initial number of transmissions H 6 MN {(f), gives 


dC 6 

dH 6 MN (ff) 


(r {TTL) y-i \ 
V^BS ~ ^D2D) 


| ( C D2D - C BS ) • (-R e (0 )) 2 • 

(■ R 6 (0) + H 6 MN (0)-(e'rPc-R'>(0)_ 1) f 

From Eq. ( EH it follows that 


( 22 ) 


dC 6 f <0 ,H 6 in ( 0) <OPT 6 
dH% N { 0) ~ [ > 0 ,H 6 mn ( 0) > OPT 6 


where 


OPT 6 


R 6 ( 0) • e5T'-Pc-fl fl (°) _ x^ 

e i-Pc-R e O) _ x 


(23) 


Therefore, when OPT 6 £ [0,ii e (0)], the minimum cost is 
achieved for H 6 MN ( 0) = OPT 6 . Otherwise, for OPT 6 (f 
[0,f? e (0)], and since it must hold that H 6 /IN (0) £ [0,f? e (0)], 
the minimum cost is achieved for the largest or lowest possible 
values of H 6 MN {f)). □ 


Result © reveals how content storage should be delivered 
when offloading only through MNs is considered. As it can 
be seen, the initial allocation is much different that in the 
offloading through SCs case (see Result©, and this is mainly 
due to the fact that some of the requesters get the content at 
the beginning. 


V. Simulation Results 

To validate our analysis, we compare the theoretical predic¬ 
tions against Monte Carlo simulations (Section IV- Al l. Then, 
we evaluate the cost efficiency of ’’offloading on the edge” in 
scenarios with realistic traffic demand patterns (Section [V-Bb . 

A. Model Validation 

1) Synthetic Scenarios: We first compare the theoretical 
results against Monte Carlo simulations on various synthetic 
scenarios. Synthetic simulations allow us to create a number 
of different scenarios with varying parameters. 

We generate synthetic networks, conforming to the model 
of Section III-DI as following: 

(i) We choose a probability distribution f\ (A) and for each 
pair {i,j} we draw randomly a meeting rate A ij. 

(ii) We create a sequence of contact events for every pair in 
the network with rate (Poisson processes with rates A ij). 

(iii) We select the content traffic parameters (Rq, Hq, p c , 
Hsc{ 0 ), Hmn{ 0), Ngc ), an d we simulate a large number 
of content disseminations, choosing randomly each time the 
set of requesters and the set of holders (note, however, that 
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(a) H 0 = 1. p c = 0.5 



Fig. 3: (a) Expected number of holders, H(t), and requesters, 
R(t), over time for generic scenarios with Rq = 100, 
Hsc = 0 ; (b) shows the corresponding results for the delivery 
probability, i.e. P{Td < TTL}, where TTL is the x-axis 
variable. 


the set of holders depends also on the parameters Hsc{ 0), 
Hmn{ 0) and Nsc)- 

We have created many scenarios with different combinations 
of mobility (/a (A)) and traffic (R 0 , Hq, p c , H S c{ 0), H M n( 0), 
Nsc ) characteristics. We present here a representative subset 
of them, which allow us demonstrate the accuracy of our pre¬ 
dictions and their sensitivity when varying certain parameters. 
In the presented scenarios we create nodes mobility according 
to a gamma distribution f\ (A) with mean value p,\ = 1 (i.e. 
normalized value) and variance cr' x (or, equivalently, coefficient 
of variation CV\ = (27l . Gamma distributions allow us to 

capture different levels of mobility heterogeneity by varying 
the value of CV\. 

Content Dissemination. In Fig. [3] we compare simulation 
results (average values over the different runs) of expected 
number of holders ( H(t )) / requesters ( R(t )) and content 
delivery probability P{Td < TTL} with the respective 
theoretical predictions (Lemma Q] and Result 0] respectively). 
Considering the same content traffic parameters, we simulated 
scenarios with moderate (CV\ = 1) and high (CV\ = 2) 
mobility variance, in order to show how mobility heterogeneity 
affects the accuracy of our predictions. It can be seen that 
our predictions become more accurate for lower mobility 
heterogeneity (CV\ = 1). This is due to the mean field 
approximation of the transitions rates we used in the analysis 
(see Section 1III-A1) . For scenarios with even lower mobility 
heterogeneity (e.g. CV\ = 0.5 - not shown in the plots) the 
accuracy is even better. Additionally, we need to highlight 
that these results correspond to an initial allocation of only 
one holder (// 0 = 1 ), which is the worst case scenario (i.e. 
lowest accuracy of the mean field approximation, and, thus 
our predictions) among the ones with the given mobility and 
traffic (other than Hq) characteristics. In the same scenarios, 
when considering a few more initial holders, e.g. Hq = 10 , 
theoretical results achieve an almost exact prediction. 

Similar observations can be made in Fig. 0] where we 
compare the theoretically predicted delivery delays with the 
respective simulation results. The results in Fig. 0] are in 
accordance with the above observations, i.e. the predictions’ 
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(b) p c = 1 


Fig. 4: Expected delivery delay, E[Td\, for various generic 
scenarios with Rq = 100, Hsc = 0 and (a) p c = 0.5, (b) 
Pc = 1- 


accuracy increases for (a) lower CV\, and (b) higher number 
of initial holders Hq. 

Offloading Cost. We finally present results that validate 
the cost optimization analysis of Section Hvl Fig. 0] shows the 
incurred cost for the cellular network (y-axis) under different 
number of initial holders Hq (x-axis) for various generic traffic 
scenarios. Different cooperation policies (top plots: p c = 1, 
middle plots: p c = 0.5, and bottom plots: p c = 0) and expiry 
times TTL (or, equivalently, 7 = ■ TTL) are considered. 

It can be seen that our results accurately predict the content 
dissemination cost. 

Some remarkable observations about the optimal initial 
allocation of holders that can be made in Fig. 0] (as well as in 
other scenarios we investigated) are the following: (i) In many 
cases, offloading on the edge can significantly reduce the cost 
of a content dissemination. For instance, in the scenario shown 
in Fig. 0] (bottom plot - bottom curve / black color), even 
without node cooperation ( p c = 0 ), offloading on the edge 
can reduce the cost 10 times, compared to the corresponding 
scenario without offloading (i.e. C = 100). (ii) An optimal 
initial allocation requires only a small number of (initial) 
storage resources, which in most of the cases we present is 
equal or less than 20% of the content requesters, (iii) The 
higher the allowed delay (i.e. expiry time TTL or parameter 
7 ) is, the larger the gain the cellular network can have is. For 
example, consider the red line (7 = 0.05) in the bottom plot. 
Increasing xlO the value of TTL (black line - 7 = 0.5) can 
reduce the cost (e.g. for Hq = 5 which is close to the optimal 
allocation) almost 8 times. 

2) Mobility Traces: Results of synthetic simulations 
demonstrate a significant accuracy of our predictions and 
verify the arguments used in the derivation of our results. In 
this section, we present results in more challenging scenarios, 
where node mobility characteristics depart from our model 
assumptions. 

Specifically, we use the TVCM (2D and SLAW (29) 
mobility models, which have been shown to capture well 
real mobility patterns, like power-law flights (29l , community 
structure |28l , etc. The generated scenarios we present are 
TVCM scenario: Mobile nodes move in a square area 
1000m x 1000m, which contains three areas of interest (com- 










































Fig. 5: Single content offloading cost C (Lemma O under 
different number of initial holders ( Hq , x-axis) for a synthetic 
mobility scenario with Rq = 100 , Hsc = Hq, and Cbh = 
C), s ; = 50 ■ Csc- Dashed lines correspond to theoretical 
predictions and markers to simulation results. We denote 7 = 
Ma • TTL. 

munities). Nodes move mainly inside their community (60% 
of the time) and leave it for a few short periods. Macro-cell 
BSs provide full coverage of the whole area, while 25 non¬ 
overlapping (placed on a grid) small-cell base stations (SCs), 
with a communication range of 100 m, provide further connec¬ 
tivity. Mobile nodes are equipped with D2D communication 
interfaces, for which we assume a range of 30m. 

SLAW scenario : A square area of edge length 2000m is 
simulated, where mobile nodes either move or remain static for 
a maximum time of 20 min (the other mobility parameters are 
set as in the source code provided by |29l ). Macro-cell BSs 
cover the whole area and coexist with 100 non-overlapping 
small-cells. Communication ranges are set as above. 

In Fig. [ 6 ] we present the delivery probability P{Td < 
TTL}, along with the theoretical prediction, for two con¬ 
tent traffic scenarios in the TVCM (Fig. | 6 (a)[ ) and SLAW 
(Fig. | 6 (b)| > traces. Contents with popularity /?(()) = 50 are 
initially cached to H( 0) edge nodes (half of which are MNs). 
The MNs’ participation in offloading is set to p c = 0.5. In the 
TVCM trace (Fig. | 6 (a)) it can be seen that the accuracy of our 
results is significant, despite the community structure of the 
network (which cannot be captured explicitly by our mobility 
Assumption 0. In the SLAW scenario (Fig. | 6 (b)| >, our results 
overestimate the delivery probability. However, note here that 
the number of holders in the SLAW scenario is smaller, 
and, thus, our approximation is expected to be less accurate. 
For scenarios with more initial holders the accuracy of the 
predictions increase (see e.g. Fig. |7(b)| where the accuracy is 
higher for higher Hq values). 

Although in some points the theoretical performance metrics 
deviate considerably from simulations (e.g. 20 %), the accuracy 
of the cost metrics (Lemma[3]l is less affected. Fig.[7]shows the 
incurred cost for delivering a content to /((()) = 30 requesters 
(y-axis) under different number of initial holders Hq (x-axis). 
Different initial placement policies (-ffsc(O), -Hmjv(O)), levels 
of MNs participation ( p c ), and expiry times TTL are consid¬ 
ered. In the majority of scenarios our results accurately predict 
the offloading cost. Yet, even in the case where the predictions 
are less accurate (e.g. in Fig. |7 (b)| for p\ ■ TTL = 0.05), they 



Fig. 6: Delivery probability P{Td < TTL} over time TTL 
(x-axis), for the (a) TVCM and (b) SLAW scenarios with p c = 
0.5 and H sc {0 ) = H MN { 0) = 



(a) TVCM (H SC { 0 ) = H M jv(0)) (b) SLAW (H sc ( 0) = 0, ) 


Fig. 7: Offloading cost (y-axis) vs number of initial holders 
(Hq, x-axis). Dashed lines correspond to theoretical predic¬ 
tions and markers to simulation results. Transmission costs are: 
( a ) C^ L) ~ 10 • Cbh = 10 • Cbs = 20 • Csc = 20 • Cd 2 d 
(top plot) and C^g L ' > = Cbh = Cbs = 10 • Csc (bottom 
plot); (b) C ( bs L ’ = 2 • Cbs = 10 • C D2 d- 

can still capture the actual optimal initial allocation regimes. 
B. Performance Evaluation 

After validating our analysis, we now investigate the cost 
efficiency of the ’’offloading on the edge” mechanism in a 
realistic traffic scenario. We present results that demonstrate 
the effect of different system factors, and provide useful 
conclusions for cellular network operators. 

The parameters of the scenario we consider are the following: 

— Popularity: Content popularity has been shown to follow 
a power-law distribution Q- 0 . Thus, we draw the pop¬ 
ularity of each content from a bounded-Pareto distribution 
(Rq £ [1,100]) with shape parameter a = 0.5 J7). 

— Traffic Intensity: Mobile operators do not release real 
mobile traffic data. To this end, and since traffic demand is 
directly related to the number of mobile users that reside in 
an area, we infer traffic patterns from an available dataset 
of the Gowalla location-based social network. The Gowalla 
dataset ff30l contains information (logs of position and time) 
of user checkins (through their mobile devices) in different 
venues. In the scenarios we present, we create different number 
of contents during a 24 h time interval. The number of contents 
M is proportional to the number of mobile users that checked- 
in a certain area (we selected the most popular venue) at 






















































the same time. The maximum number of concurrent contents 
is M = 200. 

— Delay Tolerance: We set equal expiry times TTL for 
each content, and we consider different sets of scenarios with 
low (TTL = 5 min), moderate (TTL = 25 min), and high 
(TTL = 25 min) delay tolerance. 

— Costs: The relative costs are set Cbs = C'b's'^ = 
10 • Cbh = 20 • Csc = 20 • Cd 2 D, values selected based 
on some data presented in GO. 

— Node Mobility: We use the TVCM mobility scenario 
presented in the previous section. 

Offloading through SCs 

We first consider the case of offloading through SCs. We 
simulate two sets of scenarios with small (Q = 5) or large 
(Q = 200) caches. We choose the optimal initial caching 
policy of Result [4] 

In Fig. [ 8 ] we present the total offloading cost (marked lines) 
incurred for the cellular network operator over different times 
of the day. The gray area shows the intensity of mobile 
users that reside in the considered area. The dashed line 
denotes traffic demand over time, or equivalently, the cost 
when content delivery without offloading is considered. 

Some interesting observations that follow from Fig. [ 8 ] are: 

(i) Under the optimal caching policy, ’’offloading on the edge” 
can significantly reduce the cost of content delivery, up to an 
order of magnitude, or even more in some cases. 

(ii) The ’’offloading on the edge” cost changes over time 
much smoother than traffic demand. In particular, for large 
caches (cross/red line), the offloading cost curve is almost 
flat, despite the large peaks in traffic demand. In cellular 
networks, such temporal variations of the traffic intensity is an 
important issue, since operators are required to over-provision 
the network capacity (high CAPEX costs) im As we show, 
’’offloading on the edge” can amortize these costs. Even under 
higher transmission costs Cbh, Csc than these we assumed, 
although the operating cost (OPEX) increases, the cost curve 
remains smooth, reducing thus a need for over-provisioning. 

(iii) Large caching capacity has as a result a smoother cost 
curve (cross/red vs circle/blue curves). This is a positive 
message for operators, because to equip SCs with large enough 
caches is both feasible and inexpensive, as discussed in the 
example scenario of Section HV-AI 

(iv) Comparing Fig. | 8 (a)| and Fig. | 8 (b)| we see that the 
tolerated delay has also a significant effect on the smoothness 
of the cost curve (higher TTL values lead to smaller vari¬ 
ations). This implies that an alternative way of avoiding the 
over-provision cost (CAPEX), is to give incentives (OPEX) 
to users for accepting delayed content. Such solutions have 
been previously considered, e.g. EES, however, our framework 
allows an easy investigation of their effects (due to the closed- 
form results) and an analytic approach of pricing policies, etc. 

Offloading through MNs 

Now, we evaluate the performance of offloading through 
MNs. We simulate scenarios with different levels of node co- 



(a) TTL = 25 min (b) TTL = 5 min 


Fig. 8: Traffic demand and offloading cost over a 24 h period. 



(a) Offloading Cost vs. Time (b) Offloading Cost vs. p c 

Fig. 9: (a) Traffic demand and offloading cost over a 24 h 
period. User cooperation is 10%. (b) Total offloading cost over 
a 24ft period, normalized to the total cost without offloading. 


operation p c . We choose the optimal initial content placement 
policy of Result 0 

In Fig. |9(a)| we present the total offloading cost (marked 
lines) incurred for the cellular network operator over different 
times of the day. We simulate three scenarios with low, 
moderate and high delay tolerance (TTL = 5,25,60 min), and 
10% of user cooperation in offloading (p c = 0.1). Similarly 
to the offloading through SCs case (see e.g. Fig. [8]». for 
higher TTL values, the cost decreases and its variations are 
smoother. However, it can be seen that improvement between 
the scenarios with TTL = 25 min and TTL = 60 min is not 
significant. This has an important implication for the system: 
Although increasing the delay tolerance is beneficial for the 
operator, after a point or gradually (depending on the scenario), 
the effects of this improvement become negligible. Bearing in 
mind that user satisfaction decreases with TTL indicates that 
there is a tradeoff, which should be carefully assessed by the 
system designer or considered for further optimization. 

In Fig. |9(a)| we show how the total offloading cost over a day 
period (normalized to the respective cost without offloading) 
changes with p c . It is evident that varying user cooperation 
does not have the same effects for different scenarios, and 
that the minimum total cost is achieved at different values of 
p c . This introduces one extra dimension, which can be used for 
system optimization as well. Such optimization options (with 
respect to TTL, p c , etc.) could lead to interesting conclusions, 
we intend to consider them in future research. 
































VI. Discussion and Extensions 

In this section we discuss some important issues related 
to assumptions made implicitly or explicitly throughout our 
analysis and how they can be extended or removed to make 
our framework more generic. 

A. Heterogeneous Mobility 

The mobility model we use allows heterogeneous meeting 
rates A, 7 (Assumption [2]) in order to account for various node 
mobility patterns and communication ranges. However, apply¬ 
ing the approximation of Eq. (0 (proof of Lemma ITlF^l . leads 
to considering only the mean value of the meeting rates fix 
in the analytical results. Although the same expressions could 
have been derived (easier) by using a homogeneous model, 
i.e. \/i,j : Aij = p our main motivation for considering 
heterogeneous rates is the following: We can easily incorporate 
further heterogeneous social characteristics (related to mobility 
patterns), like (i) social selfishness (where each node coopera¬ 
tion is related to their social ties) ED, or (ii) smart, mobility- 
aware content placement algorithms (where ’’better” holders 
are selected) (32). These characteristics could not have been 
taken into account under a homogeneous mobility assumption. 

To extend our results for the two above cases, it is just 
needed to modify the analytic expressions by substituting the 
average meeting rate px with the effective average meeting 
rate p^ 1 ^ (see (3l] Lemmas 3.1 and 3.2] and (32] Result 4], 
respectively), which is given by 

=E[\-p(\)\ and p^ ff) = (24) 

where the function p( A) describes the social selfishness and 
7t(A) the mobility-aware content placement algorithm. The 
expectations in Eq. (124) are taken over the meeting rates 
distribution f\, and, as a result, the mobility heterogeneity is 
actively involved in the performance prediction expressions. 

B. Cost Model 

In our model we considered constant costs (for the different 
transmission types). However, in different scenarios, there 
might exist some correlation between transmission costs and 
other system parameters. Some examples could be: (i) The 
delayed content delivery cost C'llJ L] might be a function of 
TTL (e.g. increasing with TTL)\ (ii) If multicasting is used 
for initial content placement to MNs, the transmission cost 
Cbs might not be linearly related to Hmn(0 ) (i.e. the cost 
of multicasting a content to ITmjv(O) nodes, might not be 
equal to Hmn(0 ) times the cost of a unicast transmission); 
(iii) The MN-MN transmission cost Cd2D might be related to 
the cooperation probability p c , e.g. the willingness of nodes 
to participate in offloading (which is captured by p c ) might be 
higher when the reward for each offloaded content (which is 
captured by Cd 2 D ) increases. 

12 In Eq. {5} the meeting rates of a sum are approximated with their mean 
value fi\. As discussed earlier, this approximation becomes more accurate 
when the heterogeneity of the meeting rates (i.e. the variance of f\) decreases. 


Our results predicting the content dissemination perfor¬ 
mance and cost (Section HTTb hold also when adopting such 
more generic cost models. What changes is the cost op¬ 
timization problem (Problem [TJ, which might need to be 
reformulated. However, even under the above examples (as 
well as a number of other cost models), Problem Q] can be 
still expressed in a closed form expression and thus be solved 
with known (analytic or numerical) methods. 

Summarizing, since till now there is no common technology 
or cost policies applied in offloading systems, we prefered to 
assume a simple cost model with constant costs. Nevertheless, 
as discussed above, our results can be extended for other cases 
as well. 

C. Content Dissemination 

The assumptions for the content dissemination (e.g. non¬ 
increasing number of requesters, only initial placement to SCs, 
etc.) can be extended and, thus, more generic scenarios can be 
captured and analysed in a similar manner. In the remainder, 
we demonstrate how our model and analysis is extended to 
include two extra characteristics, namely (i) content discards 
and (ii) bulk arrivals/departures of requesters. 

First, we extend to cases where contents can be discarded 
by edge nodes before their expiry time (TTL). For example, 
a MN whose battery level decreases, might stop storing and 
forwarding contents in order to save energy. As a second 
example, when a MN’s cache is full and a new content is 
received, the MN needs to drop either one of the stored 
contents or the one just received. On the other hand, the state 
of the SCs’ caches will be probably known by the cellular 
network and, thus, such cases can be avoided or controlled. 
To be able to analyze such content discards with a Markovian 
framework, we use the following model 

Assumption 3 (Content Dropping). A MN drops a cached 
content with rate A d > 0. The content dropping process is 
Poisson and A d is equal among all holders and all contents. 

Under the above assumption, the ODE of Eq. © can be 
turned into 

= p c ■ H(t) ■ R{t) ■ px - ( H(t) - H sc m ■ A d (25) 

where the last term corresponds to the content discards by the 
MN-holders. 

Second, we assume that at times {ti, T 2 ,...} either a number 
of new MNs get interested in a content (i.e. new requesters 
enter the system) or some of the existing requesters lose 
their interest in the content (i.e. they leave the system). We 
denote the numbers of such arriving/departing requesters at 
times {ti, T 2 ,...} as {R T1 , R T2 , • ■•}, where a value R T can be 
positive (denoting arrivals) or negative (denoting departures). 
As a result, the ODE for the number of requesters R(t) over 
time (see Eq. ©) is given now by 

= -H(t) ■ R(t) ■ px + J2 R r-S(t- t) (26) 

r 

where <5(-) is the Dirac delta function. 





Finally, solving the system of the ODEs of Eq. ( |25| > and 
Eq. (l26l t. gives the solution for the deterministic approxima¬ 
tions for H(t) and R(t) for a generic scenario with content 
discards and bulk arrivals/departures of requesters. 

VII. Related Work 

In this section we discuss works that are closer to ours, 
rather than studies which do not consider caching and/or 
delay tolerant delivery, and which are mainly based on pure 
infrastructure architectures, e.g. with WiFi access points S) or 
small-cell base stations J2), 0, or on the D2D paradigm ITOl . 

Mobile data offloading through opportunistic communica¬ 
tions and epidemic content dissemination is studied in US, 
El, [fl9l . f20l . In the setting of fl9l . copies of a content 
are distributed through the infrastructure to a subset of mobile 
nodes, which then start propagating them epidemically. The 
performance of different content “pushing” techniques (e.g. 
slow/fast start) is investigated through simulations on a real 
vehicular mobility trace. Analytical approaches for pushing 
techniques can be found in |[T3 - 1 14-1 . which study the optimal 
selection of the number of initial and final content pushes. M 
models the content dissemination as a control system and pro¬ 
poses an adaptive algorithm, HYPE , which aims to minimize 
the load of the cellular network by using real time measure¬ 
ments. On the other hand, E3 uses a fluid limit approximation 
and focuses on the cost optimization problem. Finally, ll20l 
takes into account fairness among different contents/nodes, and 
derives schedulers that maximize the throughput, under given 
mobility and wireless channel conditions. These studies, in 
contrast to our framework, assume that every user is willing to 
offload contents, even if they are not of her interest. Difficulties 
in devising incentive mechanisms or limitations of device 
capabilities, might render such settings unrealistic. 

To this end, m, m consider a limited number of 
(designated) holders. fl2ll proposes centralized algorithms for 
selecting the best set of available holders, in order to minimize 
the traffic load served by the infrastructure. In a different 
approach, 1321 focuses on the effects of popularity (number of 
requesters) and availability (number of holders) on the perfor¬ 
mance of content delivery. Our paper extends these works, by 
introducing generic offloading costs and policies, and deriving 
insightful, closed-form results for the optimal caching. 

Finally, (331 proposes caching in femto-cells and user 
devices, in a different setting than ours, where users communi¬ 
cate with several holders simultaneously. D2D communication 
is controlled by a macro-cell BS, which is aware of the status 
of caches, location of users, and channel state information 
between them. The objective of the paper is to decide which 
files should be stored and on which helper node, a problem that 
is shown to be NP-hard. This problem is formally presented, 
studied in more detail, and extended for coded contents in |5j|. 

VIII. Conclusion 

In this work we studied “offloading on the edge”, a mech¬ 
anism that employs edge nodes (SCs and/or MNs) to oppor¬ 
tunistically offload popular content. We built a model that can 


capture heterogeneous traffic demand, user cooperation and 
mobility characteristics, and describe generic caching and of¬ 
floading policies. Based on our model, we derived closed-form 
expressions for predicting the offloading performance. These 
allowed us to analytically study the cost optimization problem, 
and provide results that shed light on how caching policies 
should be designed. Realistic simulations verified the insights 
that stem from our analysis, and led to useful conclusions. 

Our closed-form expressions reveal how and to what extent 
each system parameter affects performance and cost. Thus, 
they could be easily applied to sensitivity analysis, network 
planning and dimensioning, or design of pricing strategies; 
issues that have recently attracted a lot of attention from 
network operators, who seek novel solutions to alleviate the 
effects of the rapidly growing traffic demand. 
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Appendix 


A. Proof of Result [2] 

Proof The probability a content to be delivered in the time 
interval [t, t + dt ) is given by 

P{Td e[ft + dt)} = dP{T * t ~ t} ■ dt (27) 

Since a requester gets the content at time t = TTL from 
a BS, if it has not received it earlier, we can write for the 
expected delay 


E[Ti\TTL\ = TTL • (1 - P{T d < TTL}) 

i-TTL 

+ / t ■ P{T d £ [t, t + dt)} 

= TTL ■ (1 —P{T d < TTL}) + f t- dP ^ ~ 0 -dt 

Jo dt 

(28) 

where the last equality follows from Eq. (l27l >. 

Using the expression of Result Q] we first compute the 
derivative , and, then, the integral in Eq. (l28l >. and 


we get 


E[Ti\TTL\ = TTL • (1 - P{T d < TTL}) 


1 


/TTL ■ H 0 • ( p c ■ R 0 + H 0 ) • e ^ (Pc-Ro+H 0 )-TTL 


Pc - Ro V 
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Pc ■ Rq + Hq ■ e Px-(Pc-Ro+H 0 )-TTL 
Pc - Ro + Hq 


px- p c - Rq \Pc ■ Rq + H 0 ■ ev*-(Pe- R o+ H o) TTL , 

Substituting the value of P{T d < TTL} from Result [I] in the 
above equation, after some algebraic manipulations, we can 
successively get 

TTL ■ ( p c ■ Rq + Hq) 


E[Ti\TTL\ = 

1 


Pc ■ Rq 


+ 


Px-Pc- Rq 
1 

Px- Pc- Rq 
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•In 1 + 
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Pc - Rq - e -P\- ( ,Pc-Ro+Ho)-TTL 


Hq+Pc-Rq- e -^-(Po- R o+H 0 )-TTL 

which is the expression of Result |2] for p c > 0. The expression 
for p c = 0 follows after taking the limit (p c —> 0) of the above 
expression. □ 

B. Lemma \2} Cost Monotonicity with Aq 
Lemma 2. Under a content placement policy given by 
Eq. (1191) . the derivative of the total cost, YleeM > with 
respect to Aq is 

if l S 
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dAo 


E ° 6 


i - 


7 


1 + 


•m>o 


(29a) 


L6£M 

where A = {9 € M. : L < Rq < U}. 

Proof. From the conditions (b) and (c) (see, proof of Result[4}. 
and similarly to Eqs. CODi, we can express the multipliers Aq 
and pe as a function of Ao, as 

Aq + CW , Rq < L 

0 , R 6 q > L 

-Aq - C bh (1 - 7 • $ • e-T Ns °R e 0 ) ,R S 0 >U 
0 ,Rq<U 

(29b) 

The cost of a single content dissemination, Eq. ( IT6b . under 
the content placement policy of Eq. ( IT9l ). can be written as 

->8 
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Taking its derivative, with respect to Ao, gives 


as following 
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(31) 


because the terms including only the scenario parameters (R 9 , 
7 , and costs) do not depend on the selected resource allocation 
and, thus, on the parameter Ao- 

To calculate the derivatives appearing in the right side of 
Eq. ED, we use the definition of a derivative, i.e. 


df(X 0 ) = Um /(Ao + dX 0 ) - /(Ao) 


dXo d Ao—>0 

and proceed as following: 
We first define the sets 


dXo 


(32) 


and, respectively, for Ao —> Ao + dXo, the sets 

A = {6 £ M : L + AL < R 9 0 < +AU} 
B = {6 £ M : R e 0 > U + AU} 

C ={B £ M : R e 0 < L + AL} 

where we denoted 
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The derivatives in the above sum are calculated as following 
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Regarding the first derivative term in Eq. ED . we proceed 
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Substituting Eqs. (l40t in Eq. ( l39t . gives 


d 

dX 0 


5>(7-*-flg) 

LfleC 

M ■ 1 


d 

dAp 


p(L) ■ In 1 + 


Ao \ 


E (Aq — Ae + ^ 0 ) 


.flex 




(40c) 


(41) 


7 ‘ CW ’ $ 

Substituting Eqs. (071) in Eq. (f36l) . gives 
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A in Eq. (OH . proves the Lemma. □ 


dXo 


EM 1 


.flGAt 


Ao — As + pe 

$1 


= \A\- 


*1 1 +ti¬ 


ps) 

Regarding the second derivative term in Eq. (OH . we 
proceed as following 
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.flee 

d\A\ 

dXo 

■§^+7 ■ Cbh • * ■ e~T N8 ° ■-£- 
a\o clAq 


E^ 

.flee 


« d|C| „ _ d 

— ■ 37 -1- 7 • CW • $ • -JT~ 

dXo dXo 


E< 

.flee 


(39) 


Similarly as before, we get 
d\C\ 


. _ M - 

g?Aq 7 • Cb/t • 4* 


■p(L) 


(40a) 



















































